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SHARED TRANSLATION ADDRESS CACHING 

BACKGROUND 

The invention relates to caching in memory controller 

hubs . 

Microcomputer systems generally include one or more 

memory controller hubs that control and coordinate the 

transfer of data between the computer's system memory, central 

processing unit (CPU) , and peripheral devices. Graphics 

applications may be supported by peripheral devices known as 

graphics controllers that require a memory controller hub to 

10 transfer data between the graphics controller, the system 

memory, and the CPU. 

A design concern associated with microcomputer systems is 

the quality of two-dimensional {2D) , three-dimensional (3D) , 

fy and video image (collectively referred to below as "graphics") 

O 15 processing. High-performance graphics processing requires 

processor- intensive calculations and the fast manipulation of 

large quantities of data. Several designs have been 

implemented to achieve high-performance graphics processing 

while also reducing the cost of the complete system and 

20 allowing for upgrades to the computer system's capability. 

A computer system may include a graphics controller 

coupled to local memory for storing graphics data, so that the 

amount of data that must be transferred between the graphics 
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controller and the system memory and/or the CPU is reduced. 
Increasing the amount of local memory available to the 
graphics controller improves graphics performance, but also 
increases the cost of the computer system, because local 
5 graphics memory is relatively expensive. Less local memory is 
required to achieve the same graphics performance, however, if 
a dedicated bus, e.g., an Accelerated Graphics Port (AGP), is 
used to couple the controller to the memory controller hub. 
An AGP allows the controller to treat portions of system 
p 10 memory as dedicated local graphics memory, which reduces the 
01 amount of local memory required and lowers overall system 

costs . 

Computer system costs also may be reduced by eliminating 
the peripheral graphics controller and integrating its 
15 functionality into the memory controller hub. In such a 

configuration the memory controller hub is better described as 
a graphics/memory controller hub, since it performs graphics 
processing functions in addition to memory control and 
transfer functions. Additionally, it includes one or more 
20 output ports to send graphics signals to external devices, 

such as cathode ray tubes (CRTs) and flat-panel monitors. A 
graphics/memory controller hub may be coupled to local memory 
for storing graphics data. 
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BRIEF DESCRIPTION OF DRAWINGS 
Figure 1 is a schematic block diagram of a computer 
system. 

Figure 2 is a schematic block diagram of a graphics 
memory controller hub. 

Fpgiire 3 is a schematic block diagram of accelerated 
graphics po^ (AGP) functionality of a graphics memory 
controller hub/* 

DETAILED DESCRIPTION 
In a computer system, a memory controller hub may be 

integrated with an internal graphics controller and may 

interface with an external graphics device through an AGP 

port. Because the memory controller hub controls both 

graphics and memory functions it is referred to as a 

graphics/memory controller hub (GMCH) . The GMCH provides both 

internal graphics processing and scalable graphics performance 

through an AGP interface. 

The GMCH may be used in one of two mutually exclusive 

modes: AGP mode ; in which case the GMCH uses its capability to 

interface with an external graphics controller and its 

internal graphics functionality is disabled; or Gfx mode, in 

which case the GMCH uses its internal graphics capability, and 

its ability to interface with an external graphics controller 

is disabled. In Gfx mode the GMCH can still interface with a 
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local memory module through the AGP port to provide additional 
graphics memory for use by the internal graphics. Whether the 
GMCH operates in AGP mode or Gfx mode can be determined 
automatically and set during the start-up sequence of the 
5 computer. 

Figure 1 illustrates an exemplary computer system 1 in 
which the GMCH can be implemented. The computer system 1 
includes a microprocessor (for example, CPU) 2 coupled to a 
GMCH 3, which contains a system memory controller hub. GMCH 3 
10 may also be referred to as a "chipset" or "core logic." GMCH 



3 provides an interface between CPU 2 and system memory 4 , and 



between CPU 2 and a bus, for example, a peripheral component 



ft- 



interconnect (PCI) or Hublink™ bus 5. Various input/output 



(I/O) devices 6 are coupled to PCI bus 5, which is coupled to 




GMCH 3 via input/output controller hub (ICH) 11. Computer 



system 1 may also include a graphics device 7, which may be a 



graphics controller coupled to local memory 8, or which may be 



an AGP Inline Memory Module (AIMM) that provides external 



local memory for the internal graphics functionality of GMCH 



20 



3 . 



A shared AGP/local memory interface 9 provides a dedicated 



interface bus between GMCH 3 and graphics device 7 . Graphics 



and video signals may be sent to a display device 10 from 



graphics device 7 if one is present in the computer system, or 
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may be sent to display device 10 from GIViCH 3 if graphics 
device 7 is absent. 

Figure 2 illustrates other details of GMCH 3, including a 
CPU interface 20 coupled to an AGP interface 21, a local 
memory interface 22, an input/output (I/O) hub interface 23, 
and a system memory interface 24. Graphics functions can be 
performed by internal graphics components 25, which include a 
data stream and dispatch controller 26 to manage the flow of 
data and various graphics engines 27 to perform graphics 
10 operations on data, 

AGP transactions are run in a 
split tr^saction fashion in which a request for data transfer 
to or from system memory 4 is disconnected in time from the 
data transfer jstself . An AGP compliant graphics device (bus 
15 master) 7a initiatSses a transaction with an access request. 

The AGP interface 2]\responds to the request by directing the 
corresponding data transfer at a later time, which permits the 
AGP graphics device 7a to\ipeline several access requests 
while waiting for data transfers to occur. As a result of 
20 pipelining, several read and/orVrite access requests may be 
simultaneously outstanding in requ\st queues 100. Access 
requests can either be pipelined across an address/data bus 
(AD bus) 105, 107 of AGP 9 or transferr^ through sideband 
address lines 107 of AGP 9 and received byVrequest queue 100. 



Docket No.: 10559-1^^1 (P8249) 

Scheduler 102 processes the access requests in request 
queue 100 >v Read data are obtained from system memory 4 and 
are returned the initiative of scheduler 102 through read 
data return queue^3s04 and across AD bus 105 of the AGP 9. 
Write data are provided by AGP compliant graphics controller 7 
at the direction of schedu^ser 102 when space is available in 
the write data queue 108. ThijX/ AGP transactions generally 
include interleaved access requesD^^ and data transfers. 

Graphics data may be stored in system memory 4 when GMCH 
3 operates in AGP mode in conjunction with an external AGP 
compliant graphics controller 7a, or when GMCH 3 operates in 
Gfx mode using its internal graphics functionality. When 
using system memory 4 to store graphics data, GMCH 3 uses a 
virtual memory addressing concept for accessing graphics data. 
In AGP mode, a 32 MB or 64 MB graphics aperture is defined 
through which addresses in the physical system memory 4 can be 
accessed by graphics controller 7a. The graphics aperture 
appears to graphics controller 7a as a 32 MB or 64 MB 
contiguous block of linear memory, although the addresses in 
physical system memory allocated for use by AGP graphics 
controller 7a are not contiguous. The contiguous block of 
memory addresses in the graphics aperture permits graphics 
controller 7a to quickly access large data structures, such as 
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texture bitmaps (typically 1 KB to 128 KB) , as single entities 
in virtual memory. 

Access requests from graphics controller 7a address 
virtual memory within the aperture range, and then GMCH 3 
forwards access requests within the aperture to physical 
system memory 4. The originally- issued addresses sent from 
graphics controller 7a are translated within data stream 
controller 26 using a Graphics Address Remapping Table (GART) . 
The GART is a table that matches virtual memory address in the 
aperture range with corresponding physical memory addresses. 
The GART is stored in system memory in a location known to 
GMCH 3 because its location is stored in a register within 
GMCH 3 . Addresses are mapped from the graphics aperture into 
system memory in 4 KB pages, and each entry of the GART 
translates one 4 KB page. Thus, when an access request is 
received from graphics controller 7a in the graphics aperture, 
the request is momentarily stalled while the appropriate GART 
entry is fetched from system memory 4. The address of the 
access request within the graphics aperture is translated 
using the fetched translation table entry, and the request is 
forwarded to the physical address in system memory 4 
identified by the fetched GART entry. 

To speed up memory access requests from AGP graphics 
controller 7a to system memory 4, GMCH 3 provides a GART entry 
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cache 28 for locally storing up to four entries from the GART. 
GART entry cache 28 may also be known as a translation 
lookaside buffer (TLB) . When a GART entry is first retrieved 
from the GART in system memory 4 to translate a virtual 
address into a physical address, the entry may be stored in 
the TLB 28 residing in data stream controller 26. The next 
time an address request from graphics controller 7a needs to 
use the same GART entry, the entry can be retrieved from the 
local TLB 2 8 rather than from the GART in distant system 
memory 4 . Since GART entries may be stored in TLB 28 and each 
GART entry provides access to a 4KB page of memory addresses, 
up to 16 KB of access requests from graphics controller 7a may 
be translated using the GART entries stored locally in TLB 28 
before a new GART entry must be retrieved from system memory 
4. If data stream controller 26 needs to use a GART entry 
that is not stored locally in TLB 28, the necessary entry may 
be retrieved from system memory 4 and then stored in TLB 2 8 
for future use, thereby replacing an entry previously stored 
in TLB 28. 

Referring again to Figure 2, in Gfx mode, the internal 
graphics engines 27 of GMCH 3 define a 64 MB logical address 
space through which addresses in the physical system memory 4 
or AIMM can be accessed by internal graphics engines 27. The 
logical address space appears to graphics controller 7a as a 
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32 MB or 64 MB contiguous block of linear memory, although the 
addresses in physical system memory 4 or AIMM allocated for 
use by internal graphics engines 27 are not contiguous. Like 
the graphics aperture used in AGP mode, the contiguous block 
5 of memory addresses in the logical address space permits 

internal graphics engines 27 to access large data structures 
quickly as single entities in virtual memory. 

Access requests from internal graphics engines 27 are 
translated within data stream controller 26 using a Graphics 
D 10 Translation Table (GTT) , which is stored in system memory in a 



location stored by GMCH 3 in a register within GMCH 3. 



Addresses within the logical address space are mapped into 



system memory or AIMM in 4KB pages, and each entry of the GTT 



translates one 4KB page. GTT entries additionally determine 




whether access requests are mapped to system memory 4 or AIMM 



memory, if an AIMM card is present. The same TLB 28 used in 



GMCH 3 to cache GART entries may be used to store up to four 



entries locally from the GTT in order to speed access to 



physical memory. Since the number of GART entries or GTT 



20 



entries that may be stored in TLB 28 is limited by the 



physical die area size of the TLB, using the same TLB to store 



GART entries in AGP mode and to store GTT entries in Gfx mode 



effectively doubles the number of GART or GTT entries that may 



be stored in TLB compared to the number that could be stored 
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if separate TLBs were used for GART and GTT entries. 
Additionally, using the same TLB to store GART entries in AGP 
mode and to store GTT entries in Gfx mode simplifies the 
internal logic of GMCH 3 because a single logic serves both 
5 functions of the TLB. 

Other embodiments are within the scope of the following 
claims. 
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